Picture for Prayag Tiwari

Prayag Tiwari

S-SPPO: Semantic-Calibrated Self-Play Preference Optimization

Add code
Jun 01, 2026
Viaarxiv icon

Mags-RL: Wearing Multimodal LLMs a Magnifying Glass via Agentic Reinforcement Learning For Complex Scene Reasoning

Add code
May 27, 2026
Viaarxiv icon

Closed-Loop Bidirectional Prompting for Adversarial Robustness of Vision Language Models

Add code
May 25, 2026
Viaarxiv icon

HiMed: Incentivizing Hindi Reasoning in Medical LLMs

Add code
May 23, 2026
Viaarxiv icon

Herculean: An Agentic Benchmark for Financial Intelligence

Add code
May 14, 2026
Viaarxiv icon

Concordia: Self-Improving Synthetic Tables for Federated LLMs

Add code
May 11, 2026
Viaarxiv icon

A Novel Automatic Framework for Speaker Drift Detection in Synthesized Speech

Add code
Apr 07, 2026
Viaarxiv icon

Attribution Upsampling should Redistribute, Not Interpolate

Add code
Mar 17, 2026
Viaarxiv icon

MMPG: MoE-based Adaptive Multi-Perspective Graph Fusion for Protein Representation Learning

Add code
Jan 15, 2026
Viaarxiv icon

All That Glisters Is Not Gold: A Benchmark for Reference-Free Counterfactual Financial Misinformation Detection

Add code
Jan 08, 2026
Viaarxiv icon